• Àüü
  • ÀüÀÚ/Àü±â
  • Åë½Å
  • ÄÄÇ»ÅÍ
´Ý±â

»çÀÌÆ®¸Ê

Loading..

Please wait....

±¹³» ³í¹®Áö

Ȩ Ȩ > ¿¬±¸¹®Çå > ±¹³» ³í¹®Áö > Çѱ¹Á¤º¸°úÇÐȸ ³í¹®Áö > Á¤º¸°úÇÐȸ³í¹®Áö (Journal of KIISE)

Á¤º¸°úÇÐȸ³í¹®Áö (Journal of KIISE)

Current Result Document :

ÇѱÛÁ¦¸ñ(Korean Title) ¹®Àå ºÐ¼®°ú ´Ü¾î »èÁ¦¸¦ ÅëÇÑ Çѱ¹¾î ¹®Àå Ãà¾à ÄÚÆÛ½º ±¸Ãà
¿µ¹®Á¦¸ñ(English Title) Building a Korean Sentence-Compression Corpus by Analyzing Sentences and Deleting Words
ÀúÀÚ(Author) ÀÌ°æÈ£   ¹Ú¿äÇÑ   ÀÌ°øÁÖ   GyoungHo Lee   Yo-Han Park   Kong Joo Lee  
¿ø¹®¼ö·Ïó(Citation) VOL 48 NO. 02 PP. 0183 ~ 0194 (2021. 02)
Çѱ۳»¿ë
(Korean Abstract)
µö ·¯´× ±â¹ÝÀÇ ¹®Àå Ãà¾à ½Ã½ºÅÛÀ» °³¹ßÇϱâ À§Çؼ­´Â ¿ø ¹®Àå-Ãà¾à ¹®ÀåÀÇ ½ÖÀ¸·Î ±¸¼ºµÈ º´·Ä ÄÚÆÛ½º°¡ ÇÊ¿äÇÏ´Ù. º» ¿¬±¸¿¡¼­ ¿ì¸®´Â ¹®Àå Ãà¾à ¾Ë°í¸®ÁòÀ» Á¦¾ÈÇÑ´Ù. Ãà¾à ¾Ë°í¸®ÁòÀÇ ±âº» Á¢±Ù ¹æ¹ýÀº ¾ÐÃà ¹®ÀåÀÇ ¹®¹ý¼ºÀ» À¯ÁöÇϸ鼭 ÀÔ·Â ¹®ÀåÀÇ ±¸¹® ÀÇÁ¸ Æ®¸®·ÎºÎÅÍ ÀϺΠ³ëµå¸¦ Áö¿ì´Â °ÍÀÌ´Ù. ¾Ë°í¸®ÁòÀº ¹®ÀåÀÇ ±¸¹® Æ®¸® Á¦¾à Á¶°Ç°ú ÀǹÌÀû Çʼö Á¤º¸¸¦ ÀÌ¿ëÇÏ¿© »èÁ¦ÇÒ ³ëµå¸¦ ¼±ÅÃÇÑ´Ù. ½Å¹®±â»çÀÇ Ã¹ ¹®Àå°ú Çìµå¶óÀο¡ ¾Ë°í¸®ÁòÀ» Àû¿ëÇÏ¿© ¾à 140,000 ½ÖÀÇ ¿ø ¹®Àå-Ãà¾à ¹®ÀåÀÇ ÄÚÆÛ½º¸¦ ±¸ÃàÇÒ ¼ö ÀÖ¾ú´Ù. Çѱ¹¾î Ãà¾à ÄÚÆÛ½ºÀÇ Ç°ÁúÀ» Æò°¡Çϱâ À§ÇÏ¿© °¡µ¶¼º°ú Á¤º¸Àü´Þ·Â¿¡ ´ëÇØ ¼öµ¿ Æò°¡¸¦ ¼öÇàÇÑ °á°ú 5Á¡ ¸¸Á¡ Áß °¡µ¶¼º 4.75, Á¤º¸Àü´Þ·Â 4.53À» ¹Þ¾Ò´Ù.
¿µ¹®³»¿ë
(English Abstract)
Developing a sentence-compression system based on deep learning models requires a parallel corpus consisting of both original sentences and compressed sentences. In this paper, we propose a sentence-compression algorithm that can compress an original sentence into a short sentence. Our basic approach is to delete nodes from a syntactic-dependency tree of the original sentence while maintaining the grammaticality of the compressed sentence. The algorithm chooses nodes to be deleted using the structural constraints and semantically obligatory information of the sentence. By applying the algorithm to the first sentences and headlines of news articles, we built a Korean sentence-compression corpus consisting of approximately 140,000 pairs. We manually assessed the quality of the compression in terms of readability and informativeness, which yielded results of 4.75 and 4.53 out of 5, respectively.
Å°¿öµå(Keyword) Çѱ¹¾î ¹®Àå Ãà¾à ÄÚÆÛ½º   º´·Ä ÄÚÆÛ½º   »èÁ¦ ±ÔÄ¢   ÀÇÁ¸ ±¸¹® Æ®¸®   Korean sentence-compression corpus   parallel corpu   deletion rules   dependencyparsing tree  
ÆÄÀÏ÷ºÎ PDF ´Ù¿î·Îµå